Suffolk
Neanderthals harnessed fire 350,000 years earlier than previously thought
'This is the most remarkable discovery of my career.' Breakthroughs, discoveries, and DIY tips sent every weekday. Evidence uncovered in a field in Suffolk, England indicates that ancient humans intentionally harnessed fire more than 350,000 years earlier than previously believed. According to a British Museum-led study published on December 10 in the journal, our Paleolithic Neanderthal ancestors utilized technology like hearths and campfires as much as 400,000 years ago. "The implications are enormous," British Museum project curator and study coauthor Rob Davis said in a statement .
- Media > Photography (0.31)
- Health & Medicine (0.31)
Can LLMs be Scammed? A Baseline Measurement Study
Sehwag, Udari Madhushani, Patel, Kelly, Mosca, Francesca, Ravi, Vineeth, Staddon, Jessica
Despite the importance of developing generative AI models that can effectively resist scams, current literature lacks a structured framework for evaluating their vulnerability to such threats. In this work, we address this gap by constructing a benchmark based on the FINRA taxonomy and systematically assessing Large Language Models' (LLMs') vulnerability to a variety of scam tactics. First, we incorporate 37 well-defined base scam scenarios reflecting the diverse scam categories identified by FINRA taxonomy, providing a focused evaluation of LLMs' scam detection capabilities. Second, we utilize representative proprietary (GPT-3.5, GPT-4) and open-source (Llama) models to analyze their performance in scam detection. Third, our research provides critical insights into which scam tactics are most effective against LLMs and how varying persona traits and persuasive techniques influence these vulnerabilities. We reveal distinct susceptibility patterns across different models and scenarios, underscoring the need for targeted enhancements in LLM design and deployment.
- North America > United States > Virginia (0.04)
- North America > United States > Pennsylvania (0.04)
- North America > United States > Minnesota (0.04)
- (4 more...)
- Media (1.00)
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- (6 more...)
CroissantLLM: A Truly Bilingual French-English Language Model
Faysse, Manuel, Fernandes, Patrick, Guerreiro, Nuno M., Loison, António, Alves, Duarte M., Corro, Caio, Boizard, Nicolas, Alves, João, Rei, Ricardo, Martins, Pedro H., Casademunt, Antoni Bigata, Yvon, François, Martins, André F. T., Viaud, Gautier, Hudelot, Céline, Colombo, Pierre
We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a custom tokenizer, and bilingual finetuning datasets. We release the training dataset, notably containing a French split with manually curated, high-quality, and varied data sources. To assess performance outside of English, we craft a novel benchmark, FrenchBench, consisting of an array of classification and generation tasks, covering various orthogonal aspects of model performance in the French Language. Additionally, rooted in transparency and to foster further Large Language Model research, we release codebases, and dozens of checkpoints across various model sizes, training data distributions, and training steps, as well as fine-tuned Chat models, and strong translation models. We evaluate our model through the FMTI framework, and validate 81 % of the transparency criteria, far beyond the scores of even most open initiatives. This work enriches the NLP landscape, breaking away from previous English-centric work in order to strengthen our understanding of multilinguality in language models.
- Europe > Portugal > Lisbon > Lisbon (0.14)
- Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
- Europe > Belgium (0.04)
- (25 more...)
- Leisure & Entertainment (1.00)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine > Therapeutic Area (1.00)
- (6 more...)
If the Sources Could Talk: Evaluating Large Language Models for Research Assistance in History
Garcia, Giselle Gonzalez, Weilbach, Christian
The recent advent of powerful Large-Language Models (LLM) provides a new conversational form of inquiry into historical memory (or, training data, in this case). We show that by augmenting such LLMs with vector embeddings from highly specialized academic sources, a conversational methodology can be made accessible to historians and other researchers in the Humanities. Concretely, we evaluate and demonstrate how LLMs have the ability of assisting researchers while they examine a customized corpora of different types of documents, including, but not exclusive to: (1). primary sources, (2). secondary sources written by experts, and (3). the combination of these two. Compared to established search interfaces for digital catalogues, such as metadata and full-text search, we evaluate the richer conversational style of LLMs on the performance of two main types of tasks: (1). question-answering, and (2). extraction and organization of data. We demonstrate that LLMs semantic retrieval and reasoning abilities on problem-specific tasks can be applied to large textual archives that have not been part of the its training data. Therefore, LLMs can be augmented with sources relevant to specific research projects, and can be queried privately by researchers.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
- North America > Cuba (0.05)
- (37 more...)
- Education > Educational Setting > Higher Education (0.69)
- Health & Medicine > Therapeutic Area (0.46)
Foundations of Digital Arch{\ae}oludology
Browne, Cameron, Soemers, Dennis J. N. J., Piette, Éric, Stephenson, Matthew, Conrad, Michael, Crist, Walter, Depaulis, Thierry, Duggan, Eddie, Horn, Fred, Kelk, Steven, Lucas, Simon M., Neto, João Pedro, Parlett, David, Saffidine, Abdallah, Schädler, Ulrich, Silva, Jorge Nuno, de Voogt, Alex, Winands, Mark H. M.
Digital Archaeoludology (DAL) is a new field of study involving the analysis and reconstruction of ancient games from incomplete descriptions and archaeological evidence using modern computational techniques. The aim is to provide digital tools and methods to help game historians and other researchers better understand traditional games, their development throughout recorded human history, and their relationship to the development of human culture and mathematical knowledge. This work is being explored in the ERC-funded Digital Ludeme Project. The aim of this inaugural international research meeting on DAL is to gather together leading experts in relevant disciplines - computer science, artificial intelligence, machine learning, computational phylogenetics, mathematics, history, archaeology, anthropology, etc. - to discuss the key themes and establish the foundations for this new field of research, so that it may continue beyond the lifetime of its initiating project.
- Europe > Switzerland > Zürich > Zürich (0.14)
- Europe > Netherlands > Limburg > Maastricht (0.05)
- Europe > Portugal > Lisbon > Lisbon (0.04)
- (17 more...)
- Education (1.00)
- Leisure & Entertainment > Games > Chess (0.94)
SKYNET: an efficient and robust neural network training tool for machine learning in astronomy
Graff, Philip, Feroz, Farhan, Hobson, Michael P., Lasenby, Anthony N.
We present the first public release of our generic neural network training algorithm, called SkyNet. This efficient and robust machine learning tool is able to train large and deep feed-forward neural networks, including autoencoders, for use in a wide range of supervised and unsupervised learning applications, such as regression, classification, density estimation, clustering and dimensionality reduction. SkyNet uses a `pre-training' method to obtain a set of network parameters that has empirically been shown to be close to a good solution, followed by further optimisation using a regularised variant of Newton's method, where the level of regularisation is determined and adjusted automatically; the latter uses second-order derivative information to improve convergence, but without the need to evaluate or store the full Hessian matrix, by using a fast approximate method to calculate Hessian-vector products. This combination of methods allows for the training of complicated networks that are difficult to optimise using standard backpropagation techniques. SkyNet employs convergence criteria that naturally prevent overfitting, and also includes a fast algorithm for estimating the accuracy of network outputs. The utility and flexibility of SkyNet are demonstrated by application to a number of toy problems, and to astronomical problems focusing on the recovery of structure from blurred and noisy images, the identification of gamma-ray bursters, and the compression and denoising of galaxy images. The SkyNet software, which is implemented in standard ANSI C and fully parallelised using MPI, is available at http://www.mrao.cam.ac.uk/software/skynet/.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- North America > United States > New York (0.04)
- North America > United States > Maryland > Prince George's County > Greenbelt (0.04)
- (4 more...)
BAMBI: blind accelerated multimodal Bayesian inference
Graff, Philip, Feroz, Farhan, Hobson, Michael P., Lasenby, Anthony
In this paper we present an algorithm for rapid Bayesian analysis that combines the benefits of nested sampling and artificial neural networks. The blind accelerated multimodal Bayesian inference (BAMBI) algorithm implements the MultiNest package for nested sampling as well as the training of an artificial neural network (NN) to learn the likelihood function. In the case of computationally expensive likelihoods, this allows the substitution of a much more rapid approximation in order to increase significantly the speed of the analysis. We begin by demonstrating, with a few toy examples, the ability of a NN to learn complicated likelihood surfaces. BAMBI's ability to decrease running time for Bayesian inference is then demonstrated in the context of estimating cosmological parameters from Wilkinson Microwave Anisotropy Probe and other observations. We show that valuable speed increases are achieved in addition to obtaining NNs trained on the likelihood functions for the different model and data combinations. These NNs can then be used for an even faster follow-up analysis using the same likelihood and different priors. This is a fully general algorithm that can be applied, without any pre-processing, to other problems with computationally expensive likelihood functions.
- North America > United States > New York (0.04)
- Europe > United Kingdom > England > Suffolk (0.04)
- Asia > Middle East > Israel > Haifa District > Haifa (0.04)
- Asia > India (0.04)
Modeling human function learning with Gaussian processes
Griffiths, Thomas L., Lucas, Chris, Williams, Joseph, Kalish, Michael L.
Accounts of how people learn functional relationships between continuous variables have tended to focus on two possibilities: that people are estimating explicit functions, or that they are simply performing associative learning supported by similarity. We provide a rational analysis of function learning, drawing on work on regression in machine learning and statistics. Using the equivalence of Bayesian linear regression and Gaussian processes, we show that learning explicit rules and using similarity can be seen as two views of one solution to this problem. We use this insight to define a Gaussian process model of human function learning that combines the strengths of both approaches.
- North America > United States > California > Alameda County > Berkeley (0.14)
- North America > Canada > Ontario > Toronto (0.14)
- North America > United States > New York (0.04)
- (5 more...)